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This  paper  presents  a  novel,  soft  computing  based  solution  to  a  complex  optimal  control  or  dynamic 
optimization  problem  that  requires  the  solution  to  be  available  in  real-time.  The  complexities  in  this 
problem  of  optimal  guidance  of  interceptors  launched  with  high  initial  heading  errors  include  the  more 
involved  physics  of  a  three  dimensional  missile-target  engagement,  and  those  posed  by  the  assumption 
of  a  realistic  dynamic  model  such  as  time-varying  missile  speed,  thrust,  drag  and  mass,  besides  gravity, 
and  upper  bound  on  the  lateral  acceleration.  The  classic,  pure  proportional  navigation  law  is  augmented 
with  a  polynomial  function  of  the  heading  error,  and  the  values  of  the  coefficients  of  the  polynomial  are 
determined  using  differential  evolution  (DE).  The  performance  of  the  proposed  DE  enhanced  guidance 
law  is  compared  against  the  existing  conventional  laws  in  the  literature,  on  the  criteria  of  time  and  energy 
optimality,  peak  lateral  acceleration  demanded,  terminal  speed  and  robustness  to  unanticipated  target 
maneuvers,  to  illustrate  the  superiority  of  the  proposed  law. 
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1.  Introduction 

Interception  of  an  aerial  target  by  missiles  or  interceptors  is  an 
important  practical  problem.  However,  this  is  far  from  amenable 
to  any  easy  solutions  due  to  the  inherent  complexities  of  the  prob¬ 
lem.  Realistic  models  of  missile  dynamics  must  include  the  effect 
of  time-varying  missile  speed,  thrust,  drag  and  mass,  besides  grav¬ 
ity.  The  drag  in  turn  is  a  function  of  the  missile  speed,  atmospheric 
density,  and  lateral  acceleration  (latax).  Another  important  practi¬ 
cal  constraint  is  the  upper  bound  on  the  latax.  Two  simplifications 
resorted  to  in  the  literature  are  (i)  consider  only  the  kinematics,  and 
(ii)  separate  the  three  dimensional  (3-D)  engagement  into  two  pla¬ 
nar  (or  two  dimensional,  or  2-D)  kinematic  models  in  the  horizontal 
and  vertical  plane.  Enabled  by  such  gross  simplifications,  analyti¬ 
cal  or  closed  form  guidance  laws  that  can  be  easily  implemented 
at  every  time  instant,  and  with  minimum  computation,  are  formu¬ 
lated  or  derived,  as  in  [56,38],  Given  the  much  higher  complexity 
of  analysis  and  design  in  the  3-D  space,  guidance  papers  pertaining 
to  3-D  engagements  are  far  fewer  in  number,  as  compared  to  those 
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for  2-D  engagements,  in  spite  of  the  fact  that  the  first  paper  on  3-D 
guidance  was  published  as  early  as  in  the  1950s  itself  [1], 

Even  for  the  simpler  planar  kinematic  model,  interception  is  not 
an  easy  task  if  the  initial  engagement  geometry  has  high  heading 
errors.  Determination  of  the  guidance  law  in  a  high  heading  error 
engagement,  subject  to  the  kinematic  equation  alone,  has  been  the 
subject  of  many  papers  and  techniques,  and  some  of  these  tech¬ 
niques  are  based  on  methods  of  nonlinear  control  design.  Some 
examples  are:  feedback  linearization  [2],  relative  heading  error 
angle  concept  [22],  state-dependent  Riccati  equation  technique  [9], 
extended  proportional  navigation  [54],  variable  structure  control 
[18,17],  and,  a  combination  of  LOS  rate  and  heading  error  [49],  Of 
these,  only  Cloutier  and  Stansbery  [9]  and  Taur  [49]  deal  with  3- 
D  kinematic  engagements;  the  rest  consider  only  2-D  kinematic 
models. 

The  scenario  becomes  even  more  complicated  if  one  imposes  the 
additional  requirement  of  optimality.  Analytical  optimal  guidance 
laws  derived  from  kinematic  models  require  time-to-go  parameter 
for  their  implementation,  due  to  the  free  final-time  nature  of  the 
problem.  Consequently,  accurate  estimation  of  time-to-go  itself  has 
been  another  continuing  topic  of  research,  as  in  Ryoo  et  al.  [35,36] 
and  Tahk  et  al.  [48], 

The  nonlinear  optimal  control  problem  applicable  to  high  head¬ 
ing  error  initial  geometries  is  a  two  point  boundary  value  problem 
that  is  amenable  to  only  numerical  solutions  and  demand  consid¬ 
erable  computational  effort,  thereby  not  being  implementable  in 
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real-time.  Further,  the  performance  of  these  numerical  solutions 
deteriorates  for  even  small  deviations  between  the  model  assumed 
for  design  and  the  actual  plant  on  which  they  are  applied,  due  to 
the  inherently  open  loop  nature  of  these  solutions.  This  makes  the 
case  for  finding  closed  loop  solutions  or  guidance  laws,  that  are 
implementable  in  real-time,  using  a  more  realistic  dynamic  model, 
without  resorting  to  the  unrealistic  simplifying  assumptions  for  the 
sake  of  mathematical  tractability. 

The  present  paper  formulates  and  studies  the  performance  of 
an  online-implementable  optimal  guidance  law  based  on  a  realis¬ 
tic  model  of  engagement  that  explicitly  includes  all  the  constraints 
mentioned  in  the  beginning  of  Section  1,  in  3-D  physical  space, 
for  high  initial  heading  errors.  Except  for  treating  the  missile  and 
target  as  point  masses,  no  other  approximations  are  made.  The 
large  number  of  inequality  constraints  in  the  formulation  itself 
makes  it  very  difficult  to  obtain  an  analytical  or  closed  form  solu¬ 
tion,  making  numerical  solution  the  only  option.  Any  optimal 
control  problem,  linear  or  nonlinear,  with  any  number  of  com¬ 
patible  equality  and  inequality  constraints  on  state  and  controls 
can  be  solved  by  dynamic  programming  [28],  Another  method 
of  solution  of  the  nonlinear  optimal  control  problem  is  by  con¬ 
verting  it  to  a  nonlinear  programming  (NLP)  problem  and  solving 
the  NLP  problem,  as  in  Cruz  et  al.  [10],  A  method  that  has  been 
applied  in  optimal  missile  guidance,  for  a  realistic  model  -  though 
only  2-D  -  is  Imado,  Kuroda  and  Miwa  [16],  However,  as  has 
been  acknowledged  sometimes  by  the  authors  themselves  [16], 
all  these  are  too  computationally  intensive  for  implementation  in 
real-time. 

In  this  paper,  the  optimal  guidance  problem  is  posed  as  a  low 
dimensional  polynomial  optimization  problem,  and  a  numerical 
solution  to  this  problem,  computable  in  real-time,  is  obtained. 
The  basic  pure  proportional  navigation  (PPN)  law  is  augmented 
with  a  polynomial  function  of  the  heading  error,  and  optimiza¬ 
tion  is  achieved  by  using  DE  to  find  the  values  of  the  coefficients 
of  the  polynomial.  The  proposed  law  is  then  compared  with  the 
all-aspect  proportional  navigation  (AAPN)  [39],  and  PPN  laws  on 
the  following  criteria:  energy  and  time  optimality,  the  peak  latax 
demanded,  terminal  speed,  and  robustness  to  off-nominal  condi¬ 
tions  in  the  form  of  unanticipated  target  maneuvers.  The  AAPN 
law  of  Sim  et  al.  [39]  used  the  same  structure  as  in  the  guidance 
law  proposed  in  the  present  paper,  but  the  solution  proposed  was 
applicable  for  the  given  initial  conditions  only,  and  needed  to  be 
recomputed  for  every  different  set  of  initial  condition,  as  pointed 
out  by  the  authors  themselves.  It  also  assumed  the  kinematic  model 
of  engagement.  Moreover,  the  solution  methodology  was  time  con¬ 
suming  and  hence  unsuitable  for  online  implementation.  The  law 
proposed  in  the  present  paper  is  applicable  to  any  initial  conditions, 
for  a  realistic  dynamic  model  of  engagement,  and  is  also  online 
implementable. 

In  the  previous  work  of  the  authors  [31],  a  DE  based  opti¬ 
mal  guidance  law  was  proposed  for  a  two  dimensional  (2-D) 
kinematic  model  of  missile-target  engagement.  It  was  a  proof 
of  concept  that,  for  optimal  missile  guidance,  evolutionary  algo¬ 
rithms  like  the  DE  could  serve  as  an  alternative  to  the  optimal 
control  based  methods,  thereby  circumventing  the  various  diffi¬ 
culties  posed  by  these  methods.  The  work  described  in  the  present 
paper  is  much  more  elaborate,  since  it  uses  a  3-D  realistic  engage¬ 
ment  model  that  includes  missile  dynamics  and  aerodynamic 
effects. 

Compared  to  the  previous  work  of  the  authors  [31]  which 
considered  engagements  with  a  simple,  planar  kinematic  model, 
the  present  work  assumes  a  3-D  realistic  model  that  introduces 
the  following  complications:  First,  the  state  variables  are  subject 
to  numerous  interdependent  equality  and  inequality  constraints. 
Since  these  are  indirect  functions  of  the  altitude,  they  depend  on 
the  vertical  position  of  the  missile  in  the  inertial  frame  of  reference. 


Secondly,  the  latax  is  applied  by  the  missile  in  its  own  body  frame  of 
reference,  and  the  LOS  rates  are  measured  and  heading  error  angles 
are  readily  computed  in  the  LOS  frame  of  reference.  This  entails  con¬ 
version  from  one  frame  of  reference  to  another.  Thirdly,  unlike  in 
the  kinematic  model  in  which  the  missile  speed  is  constant,  mis¬ 
sile  speed  also  enters  the  state  as  a  variable,  and  is  characterized 
by  sharp  discontinuities  as  the  missile  thrust  switches  from  boost 
to  sustain,  and  sustain  to  coast  phases. 

Fourthly,  a  very  serious  practical  difficulty  is  that,  with  the 
dynamics  also  taken  into  account  in  the  realistic  model,  the  crit¬ 
ical  constraining  factor  is  the  thrust  that  is  available  only  for  a 
limited  time  period.  The  missile  speed  that  is  highly  dependent 
on  the  thrust  begins  to  fall  off  from  the  end  of  the  boost  phase, 
especially  if  the  missile  is  still  applying  a  large  latax.  Any  practi¬ 
cal  interception  is  considered  to  be  successful  only  if  the  missile 
speed  is  high  enough  just  before  interception,  so  that  the  mis¬ 
sile  is  in  a  position  to  engage  the  target  in  the  end  game  scenario. 
Hence,  for  a  successful  interception,  the  missile  thrust  should  still 
be  in  the  sustain  phase,  or  at  least  not  be  too  long  into  the  coast 
phase. 

The  guidance  law  designed  must  take  into  account  all  these 
factors,  and  hence  moving  from  a  2-D  kinematic  model  to  a  3-D 
realistic  model  is  far  from  a  mere  extension  of  the  guidance  law 
designed  for  the  2-D  kinematic  model.  The  fifth  and  final  compli¬ 
cation  that  is  of  crucial  importance  is,  the  solution  or  the  guidance 
law  that  involves  the  four  complications  described  above,  has  to 
be  available  or  computable  in  real  time,  to  be  of  any  practical 
relevance.  The  main  contribution  of  this  paper  is  in  providing  a 
real-time  solution  that  is  online  implementable  to  this  challeng¬ 
ing  problem,  using  differential  evolution  as  the  enabling  method. 
Since  the  problem  is  a  real  world  one,  there  is  no  generic,  elegant 
solution;  problem  specific  knowledge  has  to  be  incorporated,  and 
real  world  complications  have  to  be  addressed,  as  described  in  the 
paper. 

The  paper  is  organized  as  follows:  Section  2  presents  the  math¬ 
ematical  preliminaries,  the  3-D  PPN,  and  all-aspect  proportional 
navigation  (AAPN)  laws,  and  the  development  of  the  3-D  counter¬ 
part  of  the  2-D  differential  evolution  tuned  all-aspect  proportional 
navigation  (DEPN-2D)  law.  Section  3  explains  the  DE  algorithm 
used  in  the  paper,  Section  4  describes  the  design  of  the  proposed  3- 
D  guidance  law  using  DE,  Section  5  presents  the  simulation  results 
and  comparison  of  the  proposed  law  with  the  other  guidance  laws, 
and  is  followed  by  concluding  remarks  in  Section  6. 


2.  Vehicle  models  and  guidance  laws  in  3-D  engagement 
scenario 

Using  a  Cartesian  co-ordinate  representation  in  an  inertial  frame 
of  reference,  the  three  dimensional  (3-D)  point-mass  equations  of 
motion  for  the  missile  and  the  target  are  modelled  as  follows: 

Missile: 


' Xm ' 

Vmcosymcos<t>m 

y'm 

Vmcosyms\n<pm 

Z'm 

Vmsin  ym 

4>m 

aym/(Vmcosym) 

Ym 

(apm  -gcosym)/Vm 

_Vm_ 

(r  -  D)/m  - g sinym 

T.  Raghunathan,  D.  Chose  /  Applied  Soft  Computing  16(2014)20-33 


Target: 


'  VtcosytCos  4>t " 

yt 

Vtcosytsin0t 

zt 

= 

Vtsinj/t 

<i>t 

ayt/(Vtcosyt) 

Yt 

apt/Vt 

A x  =  xt-xm,  A y=yt-ym,  A z  =  zt-zmr  = 
y/ (Ax2  +  Ay2  +  Az))yl  =  tan-'(  Az/^/(Ax2  +  Ay2)),  <pL  = 
tan_1(Ay/Ax)  where  Vm  is  the  missile  speed,  F  the  time-varying 
thrust,  D  the  drag,  m  the  mass  of  the  missile,  and  g  the  acceleration 
due  to  gravity.  aym,  apm,  ayt  and  apt  are  the  yaw  and  pitch  latax’s 
of  the  missile  and  target,  respectively,  and  am  represents  the 
maximum  permissible  latax  of  the  missile.  The  angles  t/>m,  ym,  tpt, 
yt  are  Euler  angles  made  by  the  missile  and  the  target  (Fig.  1 ).  The 
target  speed  Vt  is  assumed  to  be  constant. 


2.1.  The  3-D  PPN  law 


where  is  the  missile  latax  vector  in  the  LOS  frame  of  reference. 

For  practical  implementation,  this  latax  needs  to  be  converted 
to  the  missile  body  frame  of  reference  by  using 

aM  =  TlmaM  (8) 

where  the  co-ordinate  transformation  matrix  TLm  that  achieves  the 
transformation  from  the  LOS  frame  of  reference  to  the  missile  body 
frame  of  reference  is  given  by 


Tim 


'cos  l//mCOS0m 
— sin^m 


sini/fmcos  9m  sinOm 
COS  x/fm  0 

-sim/fm  sin  9m  cos  6m 


(9) 


The  latax  ajj}  in  (8)  has  two  components  in  the  yaw  and  pitch  plane 
of  the  missile: 


aM  =  aymjm  +  Upmkm 

(10) 

aym  =  -NVmXy  sin0msim/m  +  NVmXzcos9m 

(11) 

apm  =  -NVmXy  cos  i//m 

(12) 

^  Vmsin0m  -  Vt  sinft 

(13) 

^  Vtcos 9t  sim/q  -  Vmcos  9ms\nxjfm 

(14) 

where  aym  and  apm  are  along  the  Ym  and  Zm  axes  of  the  missile  frame 
of  reference,  and  the  angles  6t  and  i//t  are  defined  with  respect  to 
the  LOS  frame  of  reference  (Fig.  1 ). 


2.2.  The  3-D  differential  evolution  tuned  all-aspect  proportional 
navigation  law 

The  PPN  law  that  meets  the  essential  requirement  of  inter- 
ceptability  can  be  modified  to  meet  the  desirable  requirement  of 
optimality  by  augmenting  it  with  an  additional  term  for  this  pur¬ 
pose,  leading  to  the  AAPN  law.  The  AAPN  guidance  law  for  2-D 
engagement  is  given  by  [39] 


A  guidance  law  that  has  been  proved  to  guarantee  interception 
of  the  target,  whether  maneuvering  or  non-maneuvering,  -  for  the 
kinematic  model  -  is  the  pure  proportional  navigation  (PPN)  law 
[13,14],  The  3-D  PPN  law  is  given  by  [41  ] 

aM  =  NQL  x  VM  (4) 

where  aM  is  the  missile  latax  vector,  N  the  navigation  constant,  T2L 
the  angular  velocity  of  the  LOS,  and  Vm  the  missile  velocity  vector. 
In  the  LOS  frame  of  reference, 

*12/  =  Xx  ii.  +  Ayji  +  X7  kj  (5) 

vLM  =  v!nxh  +  v!nyjL  +  v!nzkL  (6) 

where  Xx,  Xy,  Xz  are  the  Xi,  Yi,  Zi  components,  respectively,  of  the 
LOS  angular  velocity  vector,  iL,  jL,  kL  are  the  unit  vectors  along, 
respectively,  to  XL,  YL,  ZL  axes,  is  the  missile  velocity  vector 
VM  in  LOS  frame  of  reference,  and  Vj^x,  Vj^y,  Vj^  are  the  missile 
velocity  vector  components,  respectively,  along  Xl,  Yi,  Zi  axes,  in 
the  LOS  frame  of  reference,  and  given  by  Vj^x  =  Vmcos9mcos\lsm, 
Vmy  =  Vmcos9ms\mj/m,  =  Vmsin 9m,  with  the  angles  9m  and  i/rm 
defined  with  respect  to  the  LOS  (Fig.  1 ). 

However,  Xx  cannot  be  measured  by  the  conventional  on-board 
seeker  [41  ].  Hence,  using  only  the  last  two  terms  of  (5),  and  (6)  in 
(4), 

a‘M  =  N{(Oy  -  V^yXz)iL  +  vy*/t  -  Oyki.)  (7) 


am,2D  =  NVmX+f(h,  Vmti0)  (15) 

where /is  a  polynomial  function  of  the  heading  error  angle  h,  and 
Vrati0  =  VtIVm.  Since / is  far  less  sensitive  to  Vrati0  as  compared  to  h 
[39], /reduces  to 

f  =  ah  +  bh2  +  ch 3  (16) 

With  this,  interception-guaranteed  optimality  was  sought  to  be 
ensured,  for  the  assumed  kinematic  model.  However,  the  issue  of 
determining  a,  b,  and  c  was  not  a  simple  task  that  could  be  achieved 
in  real-time.  It  was  calculated  offline  in  Sim  et  al.  [39],  using  a  pro¬ 
tracted  three  step  procedure.  A  simple  one  step  procedure  that  was 
implementable  online  was  proposed  by  Raghunathan  and  Ghose 
[31],  and  named  the  DE-AAPN  law,  and  referred  to  as  the  DEPN- 
2D  law,  to  distinguish  it  from  the  3D  counterpart  proposed  in  this 
paper.  The  DEPN-2D  law  uses  DE  to  find  out  the  values  of  the  coef¬ 
ficients  a,  b  and  c. 

As  with  the  latax  vector  in  the  3-D  PPN  law  given  by  (10),  the 
3-D  equivalent  of  the  DEPN-2D  law  is  given  by  two  components 

A M  =  Ay/rdm  +  Apmkm  (17) 

The  assumption  made  here  is  the  same  as  in  (10):  the  missile  accel¬ 
eration  comprises  only  lateral  acceleration  (latax),  and  contains  no 
component  along  the  longitudinal  direction. 
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The  pitch  and  the  yaw  latax  components  Aym  and  Apm  in  ( 1 7 )  are 
given  by 


APm=apm+fp(hp)(jpj 

(18) 

Aym  =  Oym  +fy{hy) 

(19) 

fp  =  dphp  +  bphp  +  Cphp 

(20) 

fy  =  ayhy  +  byhy  +  cyhy 

(21) 

where  hp  and  hy  are  the  heading  errors  in  the  pitch  and  yaw  planes 
in  the  LOS  frame  of  reference.  This  is  in  contrast  to  the  2-D  case,  in 
which  only  a  single  heading  error  exists  (16).  As  in  the  2-D  case,  the 
DEPN-3D  law  involves  finding  out  the  values  of  the  coefficients  ap, 
bP,  cp,  ay,  by  and  cy  using  DE. 

The  angles  9m ,  r/rm  ,9t,i/ft,by  and  hp  in  ( 1 1 )-( 1 4 )  and  ( 1 8)-(2 1 )  are 
in  the  LOS  frame  of  reference,  whereas  the  states  of  the  missile  and 
the  target  being  subject  to  altitude  dependent  practical  constraints 
like  density  and  temperature,  have  to  be  obtained  by  integration  of 
( 1 )  and  (3 )  in  the  inertial  frame  of  reference.  This  needs  co-ordinate 
transformation  from  inertial  to  the  LOS  frame  to  be  done  at  every 
time  instant  of  the  simulation. 

Vm  =  Wm  (22) 

=  TilV't  (23) 

where  \flM  and  Vp  are  the  missile  and  target  velocity  vectors  Vm 
and  V7-  in  the  inertial  frame  of  reference,  given  by 

Vm  =  VlnJi  +  Kyji  +  V^ki  (24) 

V't  =  V^i,  +  V[yj,  +  V’zk,  (25) 

where  i/Ji,  k;  are  the  unit  vectors  along  X/,  Y;,  Z(  axes,  respectively, 
and  Vlmy,  Vlmz  are  the  missile  velocity  vector  components, 
respectively,  along  Xj,  Y(,  Z(  axes,  in  the  inertial  frame  of  refer¬ 
ence,  and  given  by  =  Vmcosymcos0m,  !/^y  =  Vmcos  ymsin0m, 
V]mz  =  Vmsinym.  Similarly,  =  Vtcos  ytc os0t,  Vjy  =  Vtcos  ytsin0t, 
V b  =  Vtsinyt. 

The  co-ordinate  transformation  matrix  that  achieves  the  trans¬ 
formation  from  inertial  to  the  LOS  frame  of  reference  Tp  is  given 
by 

’  cos  tpicosyi  sin0i.cos}4  sinyi.  ’ 


(b)  Collisioit  ragl  e  in  hfeon  tal  ( XLYL )  plan 
Fig.  2.  Collision  triangles  in  3-D  space  in  LOS  frame  of  reference. 


M,)-sln"{v„(SU)} 


Til 


-sin (j>L  cos  (pi  0 

-cos0tsinyL  -sin0tsinyt  cos  yi 


(26)  3.  Evolutionary  algorithms  and  the  differential  evolution 

algorithm 


Tu  =  t;l 

(27) 

VT  =  V&L  +  V&L  + 

(28) 

where  =  Vtoos9tcos  iftt,  =  Vtcos  0tsin0t,  and  V\z  =  Vtsin0t. 

The  angles  hp,  hy  in  (18)  and  (19)  are  given  by  (Fig.  2(a)  and  (b)) 

hp(t)  =  9m(t)-Lp(t) 

(29) 

hy(t)  =  —  Ly(t) 

(30) 

0m(t)  =  tan-1(V4z(t)V^(t)) 

(31) 

vLy(t)  =  yj  { vUt))2  +  { l4y(t))2 

(32) 

Lp(t)  =  sin"1  (V4(t)Vm(t)) 

(33) 

wt)  =  tan-1  {Vlmy  (tyvUt)) 

(34) 

A  later  addition  to  the  EA  family,  the  differential  evolution  (DE) 
was  proposed  by  Price  and  Storn  [45,46,42,30,47,29,43],  A  popular 
member  of  the  evolutionary  algorithm  (EA)  family,  the  DE  owes 
its  popularity  to  its  simplicity  of  implementation,  and  performance 
in  terms  of  reproducibility  of  results,  and  relative  insensitivity  to 
change  in  its  tuning  parameters,  that  have  been  proven  at  several 
congress  on  evolutionary  computation  (CEC)  competitions  [  1 2 ].  An 
empirical  study  of  some  basic  versions  of  the  particle  swarm  opti¬ 
mization  (PSO),  genetic  algorithm  (GA)  and  DE  algorithms,  found 
DE  to  be  the  best  performer  on  a  test  suite  of  static  optimiza¬ 
tion  benchmark  problems,  and  regarded  the  DE  as  an  excellent 
first  choice  when  faced  with  a  new  (static)  optimization  prob¬ 
lem  [51],  Another  comparison  of  the  evolutionary  computation 
(EC)  algorithms  of  the  GA,  DE,  evolutionary  programming  (EP), 
evolutionary  strategy  (ES),  ant  colony  optimization  (ACO),  PSO, 
and  the  other  algorithms  of  Tabu  search,  simulated  annealing,  and 
hybrid  approach  (HA)  is  found  in  Kannan  et  al.  [  1 9].  These  methods 
have  been  applied  to  the  real-world,  highly  constrained,  nonlinear, 
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discrete  optimization  problem  of  generation  expansion  planning, 
and  it  was  found  that,  excepting  the  HA,  the  DE  outperformed  all  the 
others  in  terms  of  success  rate  (SR),  defined  as  the  number  of  times 
the  optimal  solution  was  achieved  to  the  total  number  of  test  runs. 
It  was  opined  that  this  was  due  to  the  nature  of  mutation  employed 
by  DE.  This  provides  DE  with  one  of  its  main  assets,  ‘contour  match¬ 
ing’  [44],  which  refers  to  the  self-adaptive  property  of  the  vector 
population  to  automatically  explore  the  most  promising  regions  of 
the  solution  space,  once  they  are  detected.  A  good  survey  of  DE 
and  the  state-of-the-art  including  its  major  variants,  application 
to  difficult  optimization  problems  and  engineering  applications  is 
covered  in  Das  and  Suganthan  [12]  and  Chakraborthy  [4],  A  generic 
explanation  of  how  the  DE  algorithm  works  and  the  pseudocode 
can  be  found  in  Raghunathan  and  Ghose  [31], 

3.1.  EC,  DE  and  the  optimal  control  problem 

Given  the  inherent  advantages  of  EC  over  other  gradient  based 
optimization  methods,  the  rich  potential  of  EC  methods  for  solv¬ 
ing  optimal  control  problems  were  recognized  in  the  1990s  itself 
[26],  [25]  and  [21],  As  these  were  simply  demonstrations  of  proof 
of  concept,  most  of  the  problems  solved  in  these  were  simple  opti¬ 
mal  control  problems  for  which  analytic  solutions  existed.  This  was 
followed  by  problems  of  increasing  sophistication  like  in  Seywald 
et  al.  [37]  and  Sim  et  al.  [39],  in  which  the  GA  was  used  to  provide 
good  initial  guess  solution  for  the  methods  used  therein.  Other  than 
those  of  the  authors  of  the  present  paper,  probably  the  only  appli¬ 
cations  of  DE  to  optimal  control  problems  are  Chiou  and  Wang  [7] 
and  Cruz  et  al.  [10],  in  which  DE  is  applied  to  get  offline  solutions 
to  multimodal  problems  for  which  the  solutions  are  known  before¬ 
hand.  In  Sim  et  al.  [39],  the  optimal  control  problem  solved  was 
an  optimal  missile  guidance  problem  for  a  kinematic  model.  Here 
too,  the  solution  was  obtained  offline,  based  on  an  existing  solution 
obtained  by  a  different  method.  To  the  best  of  our  knowledge,  the 
first  online  implementable  solution  to  the  optimal  missile  guidance 
problem  obtained  by  an  EA  was  in  Raghunathan  and  Ghose  [31  ].  In 
this  work,  the  problem  solved  offline  in  Sim  et  al.  [39]  was  solved 
online  in  real-time  using  DE,  since  it  was  felt  that,  given  the  inher¬ 
ent  simplicity  of  the  DE,  it  would  be  the  most  suitable  EA  for  online 
implementation.  The  present  work  too  uses  DE  for  the  same  reason, 
but  for  the  more  complex,  realistic,  3-D  missile-target  engagement 
model. 


4.  Design  of  DEPN-3D  law  using  DE 

Several  empirical  studies  on  the  performance  of  the  different 
DE  variants  have  appeared  in  the  literature  over  the  years.  Differ¬ 
ent  DE  variants,  both  classic  and  more  sophisticated,  are  applied 
to  a  suite  of  benchmark  functions,  and  the  results  summarized. 
These  functions  varied  in  parameter  dependence,  dimensionality, 
modality  and  noise.  This  is  the  approach  followed  in,  for  instance, 
Mezura-Montes  et  al.  [24],  Brest  et  al.  [3],  Das  et  al.  [11]  and  Zhang 
and  Sanderson  [57],  But  these  are  all  static  optimization  problems, 
and  applications  of  DE  to  dynamic  optimization  or  optimal  control 
problems  itself  are  very  few  (cited  in  Section  3.1 ),  and  no  such  study 
has  been  performed  for  optimal  control  problems,  to  the  best  of  the 
authors’  knowledge. 

Obviously,  no  optimization  method  can  perform  equally  well 
on  all  optimization  problems,  as  explained  by  the  No  Free  Lunch 
(NFL)  Theorem  [52],  Consequently,  the  method  that  works  best  for 
any  particular  problem  has  to  be  found  by  trial  and  error  by  trying 
out  all  the  methods.  Applying  this  to  the  DE  variants,  the  variant 
that  works  best  for  the  optimal  control  problem  or  optimal  missile 
guidance  has  to  be  found  out  empirically,  by  trying  out  all  of  them. 
This  is  outside  the  scope  of  the  present  work,  and  may  be  considered 


for  some  future  work.  The  scope  of  the  present  paper  is  limited  to 
application  of  classic  DE  to  optimal  missile  guidance  using  a  realistic 
3-D  model. 

To  apply  DE  to  the  missile  guidance  problem,  the  values  of  the 
DE  parameters  are  first  determined.  Several  claims  and  counter¬ 
claims  are  found  in  the  DE  literature  concerning  the  rules  for  tuning 
of  the  DE  parameters,  but  these  are  not  backed  by  experimental 
justifications  [12],  Hence  the  approach  adopted  here  is  to  begin 
with  a  few  guidelines  available  in  DE  literature,  and  fine-tune  the 
algorithm  by  executing  a  few  initial  sample  runs. 


4.1.  Some  issues  in  application  ofDE  to  missile  guidance 

4.1.1.  Choice  of  tuning  parameters  NP,Cr  and  F 

A  detailed  study  of  the  best  values  of  F  and  Cr  would  call  for  a 
theoretical  analysis  as  well  as  empirical  study,  as  in  Zaharie  [55], 
Since  such  a  study  is  out  of  the  scope  of  this  paper,  a  more  prac¬ 
tical  and  immediate  way  is  adopted.  Storn  [44]  gives  a  ‘rule  of 
thumb’  guideline  of  the  approximate  range  from  which  to  begin 
while  trying  to  solve  a  problem  by  DE:  mutation  (or  differentia¬ 
tion)  constant  Fe  [0.5, 1.0],  crossover  ratio/constant  Cr  e  [0.8, 1.0], 
population  size  Np  =  1 0D,  where  D  is  the  number  of  unknown  vari¬ 
ables,  or  the  dimensionality  of  the  problem.  There  also  exist  the 
default  values  from  which  to  begin  [43  ] :  Cr  =  0.9,  and  F=  0.8.  Given 
the  unknown  solution  surface  topology,  this  default,  high  value  of  F 
that  leans  towards  exploration,  rather  than  exploitation,  is  retained. 
The  high  value  of  Cr  also  implies  that  the  trial  vector  is  more  of 
a  mutant  than  parent,  but  the  problem  is  found  to  work  well  for 
this  default  value  as  well.  As  it  was  observed  that  the  problem  was 
not  sensitive  to  reasonably  small  variations  in  the  values  of  these 
DE  parameters,  the  default  values  were  used,  without  any  further 
study  of  this  aspect  of  the  algorithm. 

The  suggested  value  of  NP=  10D  necessitates  10D  cost  function 
evaluations  in  each  generation.  Since  an  indispensable  requirement 
of  the  missile  guidance  problem  is  online  implementability,  and 
cost  function  evaluation  is  found  to  be  the  most  computationally 
intensive  of  all  steps  in  the  whole  procedure,  the  population  size  NP 
is  sought  to  be  reduced  as  the  first  step.  The  mutation  and  crossover 
operators  of  DE  impose  an  absolute  lower  limit  ( NP  ft  4),  but  since 
this  is  too  low  for  a  multi-point  directed  random  search  algorithm 
like  DE,  NP  =  4 D,  arrived  at  by  trial  and  error,  and  traded  off  against 
the  maximum  number  of  generations  Gmax,  as  in  Raghunathan  and 
Ghose  [31], 


4.1.2.  DE  variant  used 

The  classic  DE  variant  used  is  DE/best/1 /bin,  as  against 
DE/rand/l/bin,  another  choice  that  works  equally  well  for  the  prob¬ 
lem  in  this  paper.  Here  the  ‘best’  or  ‘rand’  in  the  above  notation  [44] 
refers  to  the  base  vector  that  is  perturbed  in  mutation,  ‘best’  means 
the  fittest  in  the  generation,  and  ‘rand’  means  that  the  vector  is 
chosen  randomly.  The  ‘1’  refers  to  the  number  of  difference  vec¬ 
tors  used,  and  ‘bin’  refers  to  the  binomial  crossover  scheme.  More 
details  and  comparison  of  these  variants  can  be  found  in  Stom  [44] 
and  Price  [29], 


4.1.3.  Bounds  on  the  solution  space 

The  use  of  [0,1  ]  as  the  bounds  for  |a|,  |b|  and  |c|  in  the  AAPN  law 
of  Sim  et  al.  [39]  assumes  that  the  scaling  factor  V%/r  in  (15)  scales 
the  values  of  a,  b  and  c  to  within  ±1.  Through  simulations,  it  was 
found  that  better  and  faster  solutions  are  obtained  by  enlarging  the 
solution  space  to  beyond  ±1,  probably  because  the  above  assump¬ 
tion  does  not  always  hold,  especially  when  large  heading  errors 
are  involved.  On  the  other  hand,  too  large  a  range  would  mean 
needlessly  enlarging  the  search  space,  something  that  is  not  in  the 
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interest  of  online  implementation.  By  trial  and  error,  the  bounds 
for  the  solution  space  is  fixed  as  [-  5,  5]. 

4.3.4.  The  fitness  function 

The  DEPN-3D  law  solves  the  following  optimal  control  problem 

(36, 

subject  to  (1 )— (3) 

where  Am  =  ||AmII  =  \/{Apm  +  Ajm),  with  Apm  and  Aym  replacing 
apm  and  aym,  respectively,  in  (1)  and  (2).  In  EA  terminology,  the  J 
used  in  (36)  is  the  fitness  function  used  for  evaluation  of  the  trial 
solutions. 


4.3.5.  Latax  saturation 

The  latax  saturation  that  is  represented  by  the  inequality  con¬ 
straint  (2)  has  to  be  implemented  on  two  separately  computed 
components  (18)  and  ( 1 9 ).  A  practical  way  of  implementing  it  that 
is  used  in  this  paper  is 


y/V$m?+V$n,? 


am  C^ym )  otherwise 


where  Acpm  and  Acym  are  the  pitch  and  yaw  latax  components  com¬ 
puted  by  (18)  and  (19). 


4.2.  The  DE  solution  process 
4.2.1.  Initialization 

The  initial  zeroth  generation  population  Pg=q  of  Np  trial  solu¬ 
tions  or  individuals  are  sextuples  of  {ap,  bp,  cp,  ay,  by,  cy),  generated 
randomly  within  the  range  ±  5. 

Pc=o  =  {xi,o,x2,o,  >Xnp,o!  (38) 

where 

*1,0  =  {*1,1,0’  *2,i, 0,  %,i,D.  X4,i,0,  *5,i,Q>  *6,i,o} 

=  {Op, i,0>  bp.i.th  cp,i,0>  ay,i,0>  by,i, 0>  cy,i,o) 

Pi. c=o  =  -5  +  randjl 0, 1]  x  (+5  -  (-5)) 
p  =  ap,  bp,  cp,  ay,  by  or  cy,i  =  1,2,  ...,NP 


4.2.2.  Mutation  and  crossover 

The  mutant  vector  iqG+1  is  produced  with  F=  0.8: 

vi,G+l  =  Prl.G  0.8  (pr2,C  ~  Pr3,c)  (39) 

where 


Vi.G+l  =  (Op,  bp,  Cp,  dy,  by,  Cy), 

Ps.c  =  (qp>  bP,  cp,  ay,  by,  cy). 


s  =  rl,r2,  or  r3,  and  rl  =£  r2  =£  r3  i 
The  trial  vector  u  j.  C+1  is  generated  with  Cr  =  0.9: 
J'rand,  =  int(rand,[0, 1]  x  6) 


api,  G+l  = 


bpi,  G+l  = 


{ 

I 


bpi,  G+l 
“pi.G 
bpi,  G+l 

bpi.c 


if  {rand  [0,1] 
otherwise 
if  ( rand  [0, 1  ] 
otherwise 


0.9 


0.9 


jrand  (  —  3) 

irandt  =  2) 


(40) 

(41) 


(42) 


cpi,  G+l  =  | 

Cpi,  G+l 

if  (rand[0, 1]  <  0.9  or  jrandj  =  3) 

otherwise 

(43) 

ayi,  G+l  =  | 

byi, G+l 

ayt  G 

if  {rand[ 0, 1]  <  0.9  or  jrand.  =  4) 

otherwise 

(44) 

byi,  G+l  =  | 

byi,  G+l 
.  byi,  G 

if  {rand  [0, 1]  <  0.9  or  jrand.  =  5) 

otherwise 

(45) 

cyi,  G+l  =  | 

byi,  G+l 

if  (rand[0, 1]  <  0.9  or  jrandi=  6) 

otherwise 

(46) 

ui,  G+l  =  (aj 

(47) 

If  the  trial  vector  produced  violates  any  bound  on  the  variables, 
it  is  fixed  at  the  bound  violated. 


4.2.3.  Selection 

The  values  of  J  that  the  trial  vector  ujG+1  and  the  target  vector 
Pi,G  =  (Opi,G.  bpi.c,  Cpi.c,  ayiG,  byUG,  CyiiG)  produce  are  computed.  The 
one  that  produces  lower  J  becomes  p  iG+1 ,  that  is,  a  part  of  the  next 
generation,  and  the  other  is  discarded.  By  using  this  kind  of  tourna¬ 
ment  (‘knockout’)  selection  between  every  pair  of  parent  and  child, 
DE  ensures  two  things.  One  is  that  the  diversity  of  the  population 
is  never  lost,  as  may  happen  if  the  whole  of  the  parent  population 
is  replaced  by  the  child  population,  as  in  some  evolutionary  algo¬ 
rithms  like  the  GA.  The  second  is  that,  elitism,  or  the  practice  of 
preserving  the  best  trial  solution  in  a  population  is  automatically 
ensured:  if  the  best  target  vector  is  better  than  the  competing  trial 
vector,  it  gets  passage  to  the  next  generation  child  population.  If  not, 
it  is  replaced  by  the  new  best  (trial)  vector  in  the  whole  population. 

Mutation,  crossover  and  selection  are  sequentially  carried  out 
for  G  =  l,  2,  ...Gmax  generations.  The  Gmaxth  generation  individual 
(flpf,Gmra’  b pi, Gmax  ’  cpi,cmax’  ay(,Gmm>  byi,cmm,  cyi,cmax )  that  produces 
the  least  J  is  the  best  solution  obtained  from  the  procedure. 

The  cost  or  fitness  function  J  is  computed  as  follows:  a  trial  solu¬ 
tion  xUG  =  (api  c,  bpi  G,  cpi  c,  ayi  c,  byic,  cyljG)  is  substituted  in  (20) 
and  (21)  to  find/p  and/y,  and  these  are  substituted  in  (18)  and  (19) 
to  obtain  the  latax  Ajw(t)  given  by  (17)  for  the  time  period  [to,  fyj.  For 
the  constrained  input  case,  the  latax  is  fixed  at  ±djw,  for  the  time 
segments  in  which  the  latax  exceeds  the  maximum  bound  ±  aM. 
This  constrained  latax  is  used  to  integrate  ( 1 )  and  (3)  until  the  miss 
distance  is  less  than  the  lethal  radius  assumed.  The  integration  is 
done  numerically  using  a  fourth  order  Runge-Kutta  method.  J  is 
calculated  with  the  help  of  the  t/ thus  determined,  and  Am(t),  using 
(36). 

The  application  of  DE  to  the  missile  guidance  problem  is  shown 
in  the  form  of  a  flowchart  in  Fig.  3.  It  is  worth  noting  that  this  pro¬ 
cedure  is  general  enough  to  determine  an  optimal  missile  guidance 
law  for  any  missile-target  engagement  problem,  whose  structure 
consists  of  a  set  of  tunable  parameters  that  have  to  be  arrived  at  by 
trial  and  error. 

4.3.  Online-implementation 

As  cited  in  Section  3.1 ,  the  previous  application  of  DE  to  optimal 
control  [10]  was  offline,  and  took  more  than  a  hundred  seconds  to 
compute  the  control  for  a  dynamic  process  that  was  of  less  than 
one  second  duration.  The  solution  process  for  AAPN  too,  proposed 
by  Sim  et  al.  [39]  was  offline.  Consequently,  computation  time  was 
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Initialize:  generate 
initial  population  of  PG=0  (Eq.  (38)) 
of  Nd  trial  vectors 


3X 


Apply  the  mutation  and  crossover 
operators  of  DE  (Eq.  (39)  -  (47)) 
to  produce  Np  trial  vectors 


Fig.  3.  Application  of  DE  to  determine  the  DE-tuned-AAPN  (DEPN-3D). 


not  a  constraint  in  these  references.  As  explained  in  Thangavelu  and 
Pradeep  [50]  and  Raghunathan  and  Ghose  [31],  for  dynamic  opti¬ 
mization  or  optimal  control  applications,  EAs  could  be  employed  for 
finding  solutions  online,  under  any  of  the  following  circumstances: 

(i)  the  plant  or  system  model  is  available,  and  the  dynamics  is  slow 
enough  to  allow  the  computation  time  needed  by  the  EA  to  arrive  at 
the  solution,  (ii)  a  ‘good  enough’  solution,  not  necessarily  the  best, 
is  all  that  is  needed  for  the  application  and  (iii)  massively  parallel 
implementation  is  possible,  so  that  the  best  solution  is  obtained  in 
a  small  interval  of  time. 

However,  the  missile  guidance  problem  falls  in  none  of  the 
above  three  categories.  The  dynamics  is  very  fast,  and  the  best 
solution,  the  one  that  produces  minimum  J  with  acceptable  miss 
distance,  is  the  one  that  is  acceptable.  Though  computing  capacities 
of  computers  have  been  increasing  rapidly,  the  onboard  computing 


Fig.  4.  Scheme  for  online  implementation  of  DEPN-3D. 

power  on  the  missile  is  still  not  sufficient  to  allow  massively  paral¬ 
lel  implementation.  To  overcome  the  above  difficulties,  this  paper 
proposes  a  two  loop  hybrid  control. 

Fig.  4  shows  the  two  loop  hybrid  control  scheme  for  the  online- 
implementable  DE  tuned  AAPN  (DEPN-3D)  law,  first  proposed  in 
Raghunathan  and  Ghose  [31],  and  also  used  in  the  present  work. 
The  upper  loop  that  includes  the  PPN  would  be  in  action  until  some 
intermediate,  discrete  time  instants  t,%  i  =  1 ,  2, . . . ,  t,-  <  tp  where  tf  is 
the  flight  time  computed  from  the  model  used  in  the  lower  loop. 
Within  this  time  instant,  the  DEPN-3D  law  would  compute  the  best 
solution.  In  case  no  convergence  is  achieved  within  ti  for  some 
reason,  the  upper  loop  would  continue  to  be  applied  until  t;,  i  =  2, 
3, . . .,  at  which  convergence  occurs.  Thus,  there  is  a  backup  guid¬ 
ance  law  in  the  upper  loop,  even  if  no  convergence  occurred  for  the 
entire  duration  of  the  engagement,  thereby  ensuring  at  least  the 
primary  requirement  of  capturability,  even  if  the  optimization  fails 
and  the  secondary  requirement  of  optimality  cannot  be  achieved. 
This  addresses  the  practical  problem  of  failure  of  convergence  of 
the  optimizer,  or  algorithm  adopted  for  optimization. 

4.3.1.  Reducing  the  run  time 

The  basic  methodology  of  online  implementation  is  adapted 
from  Raghunathan  and  Ghose  [31  ].  This  computationally  challeng¬ 
ing  task  of  calculating  the  DEPN-3D  guidance  law  in  real-time  is 
accomplished  by  the  following  steps: 

(i)  Increasing  the  number  of  coefficients  in  the  polynomial  func¬ 
tions  fp  and  fy  given  by  (20)  and  (21)  does  show  a  marginal 
improvement  in  the  value  of  J.  However,  since  fp  and  fy  too  need 
to  be  calculated  for  every  individual  in  every  generation,  the  num¬ 
ber  of  coefficients  in  each  of  these  equations  is  restricted  to  3,  as  in 
(16).  On  the  other  hand,  reducing  the  number  of  coefficients  to  2 
leads  to  results  that  either  vary  over  too  wide  a  range  to  be  accept¬ 
able  or  takes  too  long  to  arrive  at  results  that  are  within  acceptable 
range. 

(ii)  Even  after  using  only  3  coefficients  in  each  polynomial  func¬ 
tion,  there  are  six  unknown  coefficients  ap,  bp,  cp,  ay,  by  and  cy 
(D  =  6)  to  be  computed  in  the  cost  function  evaluation.  The  num¬ 
ber  of  unknowns  is  reduced  to  half  by  making  ap=ay  =  a,  bp  =  by  =  b, 
and  cp=cy  =  c,  to  reduce  the  computation  time  for  online  implemen¬ 
tation. 

This  also  reduces  the  population  size,  NP  =  4D  from  24  to  12. 

( iii )  Since  it  was  found  that  there  is  reasonably  good  convergence 
within  20  generations  of  the  DE,  the  DE  simulation  run  is  termi¬ 
nated  after  20  generations.  The  best  solution  triple  (a,  b,  c)  obtained 
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after  20  generations  is  used  to  calculate  the  DEPN-3D  law.  Keeping 
the  total  number  of  cost  function  evaluations  for  solving  the  prob¬ 
lem  NP  x  Gmax  constant,  one  could  also  interchange  the  values  used 
for  Np  and  Gmax,  without  increasing  the  total  computational  effort. 
While  this  does  produce  the  same  result  for  this  particular  problem, 
it  is  conjectured  that  it  is  better  to  have  the  higher  value  for  Gmax 
than  for  NP,  since  the  possibility  of  premature  convergence  is  better 
avoided  by  doing  so,  given  that  the  topography  of  the  solution  space 
is  unknown,  and  is  different  for  different  missile  trajectories,  due  to 
differences  in  altitude,  drag,  and  time  duration  of  the  trajectories. 

( iv)  The  numerical  solution  of  ( 1 )  and  ( 3 )  that  needs  to  be  solved 
as  a  part  of  fitness  evaluation  of  every  solution  vector  in  every  gen¬ 
eration  is  time  consuming.  To  alleviate  this,  the  step  size  in  the  RK-4 
function  used  to  solve  ( 1 )  and  (3 )  is  increased  to  0.1  s,  and  the  miss 
distance  for  terminating  the  simulation  is  also  increased  to  25  m. 

The  coding  and  simulation  is  done  in  MATLAB®  version  6.5.0 
and  SIMULINK®  version  5  on  a  pentium  IV,  3  GHz  computer  with 
512  MB  RAM  and  4  GB  virtual  memory.  To  achieve  fast  execution, 
the  integration  of  (1)  and  (3)  is  done  in  the  SIMULINK  environ¬ 
ment,  running  in  accelerator  mode.  Further,  the  in-built  features  of 
SIMULINK  accelerator  that  enable  even  faster  execution  are  made 
use  of.  The  rest  of  the  simulation  that  includes  applying  DE  to  the 
missile  guidance  problem  is  done  in  the  MATLAB  environment. 

5.  Implementation  and  simulation  results 

5.1.  Realistic  model  of  the  missile 

The  basic  dynamic  model  for  the  missile  that  includes  aerody¬ 
namic  effects  is  adapted  from  Cheng  and  Gupta  [6],  Kee,  Dong  and 
Siong  [20],  Ratnoo  and  Ghose  [33,34], 

The  aerodynamic  drag  D  that  appears  in  (1)  is  mod¬ 
elled  as  D  =  D0  +Dj,  where  the  zero-lift  drag  D0  =  Cd0Qs,  the 
induced  drag  D,  =  Km2a^/(Qs),  the  induced  drag  coeffi¬ 
cient  K=l/(7TAre),  the  dynamic  pressure  Q  =  jpV2,  where 
s  is  the  planform  reference  area,  Ar  the  aspect  ratio,  e  the 
efficiency  factor,  the  atmospheric  density,  p(h)  =  1.5579 
—  1.058  x  10-4h  + 3.725  x  10-9h2  —  6.0  x  10-14h3,  where  he[0, 
20,  000]  is  the  altitude. 

The  coefficient  Cdo  =  0.02,  M< 0.93  ;Cd0  =  0.02  +  0.2(M- 0.93), 
0.93  <  M  <  1 .03  ;  Cd0  =  0.04  +  0.06(M  -  1 .03 ), 

1.03  <M<  1.10 ;  Cdo  =  0.0442  +  0.007(M—  1.10),  M>1.10  where 
M  is  the  Mach  number,  and  the  reference  area  s  is  1  m2.  The  com¬ 
puted  values  of  K  are:  K=0.2,  M<1.15;K=0.2  +  0.246(M-1.15), 
M>  1.15.  The  Mach  number,  M  =  ,  R  =  288,  where  the  tem¬ 

perature  variation  with  altitude  is  given  by  T=  288.1 6  -  0.0065  h  K, 
h  <  11,  000  m ;  T=  216.66 K,  h>ll,  000 m. 

The  thrust  profile  and  the  mass  are  assumed  to  be:  r  =  12,  000 
N,  m  =  165  -4t  kg,  for  0  <  t<  10  s  ;  T  =  2000  N,  m  =  125  -It  kg,  for 
10<  t<40  s ;  F  =  0  N,  m  =  95  kg,  for  40  s<  t. 

5.2.  Missile-target  engagement  variables 

The  proposed  method  is  implemented  on  the  following  prob¬ 
lem: 

Without  loss  of  generality,  for  simplicity  and  convenience  of 
calculation,  the  initial  LOS  is  assumed  to  be  along  the  Xj  axis 
(n  =  0t  =  O).  The  other  initial  conditions  of  the  engagement  are: 
Xm=ym=yt  =  o, zm  =  zt  =  3000, xt  =  3000,  Vm  =  400,  Vt  =  200. 

The  time-weight  Wt=  104,  the  value  used  in  Raghunathan  and 
Ghose  [31  ]  and  Sim  et  al.  [39],  Though  practical  values  of  the  navi¬ 
gation  constant  N,  is  usually  set  at  between  2  and  4  (p.  361 ,  Lin  [23  ] ), 
this  value  is  chosen  to  be  3,  since  this  value  has  been  proven  to  be 
optimal  at  least  under  some  conditions  [4],  The  lethal  radius  of  the 
missile  is  assumed  to  be  1 0  m  in  the  simulations  used  for  validation. 


The  other  nominal  conditions  assumed  are  a  lag-free  guidance  sys¬ 
tem  (T=0),  and  a  15g  constraint  on  latax  (Sm  =  147  m/s2).  The 
interception  or  capture  is  deemed  to  be  successful  only  if  the  miss 
distance  falls  to  less  than  10m  without  the  missile  terminal  speed 
( Vm(tf ))  falling  to  less  than  a  cutoff  speed  of  350  m/s,  and  the  inte¬ 
gration  of  (1 )  and  (3)  is  stopped  at  this  time  instant. 

Following  in  the  footsteps  of  Morgan  et  al.  [27  ] ,  Choi  et  al.  [  8  ]  and 
Hossain  et  al.  [15],  the  guidance  law  used  for  comparison  against 
the  DEPN-3D  law  of  this  paper  is  the  PPN,  which  is  a  ready  bench¬ 
mark,  and  one  that  has  been  extensively  studied  in  the  literature 
and  believed  to  have  been  widely  implemented  in  practice.  As  the 
law  superior  to  even  the  PPN  when  target  maneuver  is  known  [56], 
the  augmented  PPN  (APPN)  (covered  in  the  Appendix  for  the  sake 
of  completeness)  too  has  been  used  as  another  benchmark  for  com¬ 
parison  in  this  paper.  The  final  benchmark  used  in  this  paper  is  the 
AAPN  of  Sim  et  al.  [39],  since  it  is  the  original  law  upon  which  the 
past  [31  ]  and  present  works  of  the  authors  are  loosely  based  on. 

Although  the  method  proposed  in  this  paper  is  online- 
implementable,  the  results  of  the  offline-implementation  of  the 
proposed  method  are  also  shown,  for  a  fair  comparison  with  the 
AAPN,  since  the  AAPN  is  an  offline  law.2  This  may  also  be  useful  in 
the  scenario  that,  if  the  target  variables  are  known  prior  to  inter¬ 
ception,  the  optimal  control  and  trajectories  may  be  calculated  in 
advance. 

The  time  required  for  computing  the  online  version  of  the  DEPN- 
3D  law  (the  lower  loop  in  Fig.  4)  is  found  to  be  mostly  just  over  5  s, 
and  always  under  6  s,  for  D  =  3,  NP  =  12,  Gmax  =  20  (the  values  arrived 
at  in  Section  4.3.1).  Hence,  the  first  intermediate  time  instant  tj 
(discussed  in  Section  4.3)  is  fixed  at  6  s.  For  the  problem  in  this 
paper,  since  convergence  always  occurred  within  6  s,  there  was  no 
need  to  consider  further  intermediate  time  instants  (t2,  t3, . . .). 

5.3.  Nominal  case  with  a  receding,  non-maneuvering  target  (Case 

Ua)) 

A  non-maneuvering  ( ayt  =  apt  =  0 ),  receding  target  is  assumed 
to  be  flying  away  from  the  missile  in  an  exact  tail  chase  initially 
(yt  =  0t  =  0°),  and  the  missile  has  very  high  initial  heading  errors  of 
hp — 30°,  hy  =  125°. 

The  performance  of  the  various  guidance  laws  under  nominal 
conditions  are  shown  in  Table  1  (Case  1(a))  and  Fig.  6(a)  and  (b). 
The  DEPN-3D  law  is  labeled  as  DEPN  in  all  the  graphs,  owing  to 
space  constraint.  The  performance  index  J  of  the  PPN  law  is  shown 
as  the  base  value  of  1 00%  for  easy  comparison.  It  is  seen  from  Table  1 
that,  both  the  online  and  offline  implementations  of  the  DEPN-3D 
perform  better  than  the  other  laws,  with  respect  to  three  of  the  five 
criteria  used  for  comparison:  tf,J  and  Vm(tf).  The  performance  of  all 
laws  as  per  the  fourth  criterion,  the  am(max)  is  the  same:  all  laws 
demand  the  limiting  value  of  the  latax,  am.  This  is  because  of  latax 
saturation  which  occurs  since  the  heading  error  is  very  high.  The 
fifth  criterion  of  the  limiting  values  of  target  maneuver  for  which 
capture  takes  place  is  inapplicable  in  the  nominal  case. 

Comparing  the  online  implementable  laws,  the  online  imple¬ 
mentation  of  DEPN-3D  is  constrained  from  performing  even  better 
by  the  fact  that  it  is  applied  only  from  ti  =  6  s,  and  it  is  the  PPN  that 
is  applied  until  then.  In  spite  of  this,  it  is  a  better  performer  than 
the  PPN  and  AAPN.  Fig.  6(a)  shows  how  this  is  achieved.  For  the 
same  latax  saturation  limit  of  15g=  147  m/s2  that  is  reached  by  all 
the  laws,  it  applies  more  pitch  latax  from  t,  onwards,  thus  making 
the  missile  rise  to  a  higher  altitude  in  the  atmosphere.  This  enables 
the  missile  to  take  advantage  of  the  reduced  drag  at  higher  alti¬ 
tude.  However,  this  has  to  be  traded  off  against  the  continuous  loss 


2  The  classic  laws,  PPN  and  APPN  are  online  implementable,  as  indicated  in 

Tables  1  and  2. 
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Case  1  -  receding  target,  initial  separation  =  3000  m,  hp  =  -30°,  hy  =  125°. 


PPN  (online  implementable) 

AAPN  (offline  only) 

DEPN-3D  (offline 
implementable) 

DEPN-3D  (online 
implementable) 

APPN  (optimal  law; 
online  implementable) 

(a)nom:  aT-0 

18.96 

18.19 

16.63 

17.86 

18.96 

(b)ayt0  =  -1.52g, 

20.71 

19.55 

17.59 

18.70 

19.63 

apto  =  +1.52g,~(B-t-E 
(c)aj,(o  =  -1.52g. 

,  most  unfavorable  mnvr) 

19.47 

18.43 

16.74 

17.70 

20.33 

t/(s) 

apt0  =  — 1.52g,~(B-t- 
(d)  ayt  =  -0.39g, 

B,  least  unfavorable  mnvr) 

21.45 

20.32 

18.21 

19.91 

20.97 

Opt  =  +0.39g,  step  (unfavorable  mnvr) 

(e)ayt  =  -0.39g,  18.04 

17.46 

16.59 

17.40 

18.25 

apt  =  -0.39g,  step  (favorable  mnvr) 

(a)nom:  aT  =  0  100% 

97.65 

92.0 

95.98 

100 

(b) -(B-t-B) 

=  2.387  x  105 

107.0 

103.35 

96.52 

99.87 

107.16 

K%) 

(cMB-t-B) 

103.9 

100.80 

94.01 

96.98 

104.52 

(d)=F0.39g,  step 

104.78 

101.93 

96.48 

100.67 

104.32 

(e)  -0.39g,  step 

96.69 

94.55 

91.41 

93.76 

97.07 

am(max.),  m/s 

(a)nom:  aT  =  0 

147  (=5m),  for  all  cases,  (a)-(e) 

381.64 

397.44 

446.15 

407.78 

381.64 

Vm(t/) 

(b)  —(B-t-B) 

350.08 

370.13 

419.36 

388.03 

367.97 

(m/s2) 

(c)— (B-t-B) 

368.77 

387.61 

435.19 

405.49 

355.15 

(d)=F0.39g,  step 

350.59 

366.93 

415.45 

377.07 

357.25 

(e)  -0.39g,  step 

396.48 

410.82 

447.25 

417.32 

391.12 

Capture 

(b)  —(B-t-B) 

1.52g 

2.52g 

3.73g 

3.93g 

1.85g 

(cMB-t-B) 

3.07g 

3.92g 

4.35g 

4.62g 

1.86g 

(d)=r.  step 

0.39g 

0.59g 

1.1 6g 

0.72g 

0.50g 

(e)-,  step 

2.09g 

2.09g 

2.1 5g 

2.10g 

1.87g 

of  kinetic  energy  due  to  drag  on  account  of  the  longer  flight  dura¬ 
tion  of  the  longer  path  involved  in  doing  so.  Thus,  the  climb  has  to 
be  only  to  an  optimal  height,  as  has  been  noted  in  the  literature 
[6,53,32,40],  The  DEPN-3D  law  accomplishes  this  automatically, 
given  the  model  of  the  atmosphere,  and  the  aerodynamic  model 
of  the  missile.  The  offline  DEPN-3D  faces  the  least  drag,  and  the 
PPN  the  most,  as  may  be  deduced  from  Fig.  6(b). 

The  offline  implementation  of  DEPN-3D  is  the  best  performer 
of  all  laws,  since  the  guidance  algorithm  can  be  applied  from  the 
beginning  of  the  engagement  (t=0),  and  the  tuning  of  the  coeffi¬ 
cients  a,  b,  and  c  is  much  better  than  the  default  values  of  these 
coefficients  in  the  AAPN  law,  which  is  also  an  offline  implementa¬ 
tion. 

It  is  seen  from  Table  1  that  the  terminal  speed  Vm(tf )  of  the  mis¬ 
sile  is  inversely  related  to  the  flight  time  tp  since  the  missile  loses  its 
speed  at  every  instant  of  flight  time  due  to  aerodynamic  drag.  Due  to 
the  shorter  flight  times,  the  DEPN-3D  laws  are  the  best  performers 
in  this  respect  as  well. 

A  complicating  factor  with  the  use  of  Euler  angles  in  3-D  engage¬ 
ment  scenarios  is  the  singularity  that  arises  for  some  values  of  the 
variables.  This  complication  arises  in  the  integration  of  (1 )  and  (3), 
as  for  instance,  when  the  angles  yt  and  ym  cross  90°,  making  0t 
and  0m  shoot  to  very  high  values  at  these  points.  This  difficulty  is 
tackled  by  limiting  these  values  to  |0.2|  at  these  points. 

The  trajectories  of  the  missile  and  the  target  have  been  included 
for  better  understanding  and  visualization  of  the  performance  of 
the  guidance  laws,  for  all  the  cases  in  this  paper. 

A  typical  convergence  curve  of  the  solution  for  the  online- 
implementable  DEPN-3D  is  shown  in  Fig.  5.  The  variation  in  the 
values  of ]  obtained  in  repeated  trials,  after  20  generations,  was  less 
than  0.1  %,  obviating  the  need  for  any  detailed  statistical  analysis. 

Fig.  6. 

5.4  Receding,  bank-to-bank  maneuvering  target  (Cases  1(b)  and 

(c)) 

The  true  test  of  any  guidance  law  is  when  it  faces  off-nominal 
conditions.  By  far  the  most  important  off-nominal  condition  that 


causes  performance  degradation  is  the  target  maneuver.  Unsurpris¬ 
ingly  then,  one  of  the  first  improvements  to  PN  towards  optimizing 
its  performance  was  to  address  this  issue  by  coming  up  with  an 
augmentation  term  involving  target  maneuver,  leading  to  the  APPN 
law  (see  Appendix  A). 

The  robustness  of  the  guidance  laws  in  this  paper  is  tested  by 
letting  the  target  undertake  unanticipated  maneuvers.  The  target  is 
assumed  to  be  undertaking  the  two  most  common  evasive  maneu¬ 
vers  in  practice:  bank-to-bank  (B-t-B),  and  step.  Further,  for  ease 
of  comparison,  the  target  is  assumed  to  be  maneuvering  equally  in 
the  pitch  and  the  yaw  planes. 

The  performance  of  all  the  laws  to  a  B-t-B  maneuvering  target  is 
considered  first.  The  results  are  shown  in  Table  1  as  Case  1(b).  The 
maximum  target  maneuver  magnitude  for  which  all  laws  intercept 
the  target  is  determined  to  be  1 .52g.  As  can  be  expected,  the  perfor¬ 
mance  of  all  the  laws  is  poorer  when  compared  with  the  nominal 
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M-T  trajectories  of  all  laws,  tail  chase 


Fig.  6.  Case  1(a)  -  receding  target,  initial  separation  =  3000  m,  hp  =  -30°,  hy  = 
125°,aM  =  15g. 


case  (Case  1(a)).  However,  the  pattern  of  results  for  the  nominal 
case  is  seen  to  be  repeated  in  this  case  too.  The  least  values  of  J 
is  for  the  offline  DEPN-3D,  and  the  maximum  value  is  for  APPN, 
which  are  slightly  more  than  that  for  PPN.  This  prima  facie  surpris¬ 
ing  result  is  readily  explained  by  the  fact  that,  armed  with  target 
maneuver  information,  it  compensates  the  target  maneuver  from 
t0  itself,  a  futile  effort  that  is  more  costly  in  terms  of  J  in  the  more 
practical  world  that  the  aerodynamic  model  represents  -  when  the 
target  is  far  away.  The  maximum  value  ofVm(tf)  is  for  offline  DEPN- 
3D,  and  the  minimum  value  is  for  PPN.  This  is  explained  by  the 
fact  that  the  DEPN-3D  has  the  least  flight  time,  and  hence  suf¬ 
fers  the  least  energy  loss  due  to  drag.  Similar  reasoning  applies 
to  PPN. 

The  maximum  magnitude  of  the  unanticipated  target  maneu¬ 
ver  for  which  the  target  can  be  intercepted  is  the  minimum  for 
PPN  (1.52g),  and  more  than  double  this  value  for  online  DEPN- 
3D  (3.93g).  Interestingly,  the  corresponding  value  for  the  offline 
DEPN-3D  law  (3.73g)  is  lower  than  for  the  online  DEPN-3D  law. 
This  is  explainable  by  the  fact  that,  for  a  sinusoidal  maneu¬ 
vering  target,  a  guidance  law  that  applies  higher  latax  from  to 
itself  is  applying  too  much  latax  too  soon,  expends  energy  in 


M-T  trajectories,  tail  chase,  off-nominal,  most  unfavorable  B-t-B  mnvr,  1.52g 


Missile  at  t=0 


Fig.  7.  Case  1(b)-  M-T  trajectories,  tail  chase,  off-nominal,  most  unfavorable  B-t-B 
mnvr,  1.52g. 


vain,  and  finds  itself  weaker  in  the  later  stages  of  the  engage¬ 
ment.  The  missile-target  trajectories  for  this  case  are  shown  in 
Fig.  7. 

Through  simulations,  the  B-t-B  maneuver  that  is  least  unfavor¬ 
able  to  interception  is  found  to  be:  both  ayt  and  apt  negative  for 
the  first  6  s,  of  the  opposite  sign  for  the  next  6  s,  and  so  on.  This 
case  is  shown  as  Case  1(c)  in  Table  1.  For  comparison  with  the  most 
unfavorable  case  (Case  1(b)),  the  magnitude  of  the  maneuver  is 
restricted  to  the  same  magnitude  of  1 ,52g  in  each  plane.  The  perfor¬ 
mance  of  all  the  laws  is  expectedly  poorer  than  for  the  nominal  case 
(Case  1(a)),  but  better  than  that  for  the  most  unfavorable  case.  The 
pattern  of  the  best  results  for  offline  DEPN-3D  and  poorest  results 
for  APPN  is  again  seen  to  be  repeated  in  this  case  too.  By  compensat¬ 
ing  for  a  downward  moving  target  from  to  itself,  the  APPN  law  loses 
altitude  and  hence  requires  even  more  time  and  energy  in  the  later 
stages  of  the  engagement  than  PPN  for  interception.  This  is  reflected 
in  the  values  of  tf  and  J.  Since  this  case  is  less  unfavorable  than 
Case  1(b),  all  laws  except  the  APPN  can  intercept  with  higher  target 
maneuver  levels  than  for  Case  1(b).  For  instance,  the  PPN  can  inter¬ 
cept  up  to  3.07g  for  this  case,  as  against  1 ,52g  for  Case  1(b),  and  the 
offline  DEPN-3D  can  intercept  up  to  4.62g  for  this  case,  as  against 
3.93g  for  Case  1(b).  The  corresponding  figures  for  most  unfavorable 
maneuver  and  least  unfavorable  maneuver  are  3.73g  and  4.35g  for 
the  online  DEPN-3D.  These  figures  suggest  that,  when  faced  with  an 
unanticipated  B-t-B  target  maneuver,  the  PPN  is  more  sensitive  to 
the  direction  and  magnitude  of  the  maneuvers  than  the  DEPN-3D, 
and  having  target  maneuver  information  that  is  compensated  from 
to  itself  like  the  APPN  does,  is  either  no  advantage  at  all  (Casel(b)), 
or  a  disadvantage  (Casel  (c),  in  a  realistic  scenario.  The  AAPN  is  seen 
to  be  less  sensitive  than  the  PPN,  and  more  sensitive  than  DEPN- 
3D,  as  per  this  criterion.  With  its  performance  against  PPN  itself  not 
being  so  favorable,  the  APPN  is  definitely  a  poor  performer  against 
the  DEPN-3D  laws,  as  per  four  out  of  the  five  criteria  chosen  for  com¬ 
parison,  the  exception  being  the  maximum  latax  that  is  same  for 
all  laws. 

The  logical  conclusion  is  that,  in  a  realistic  scenario,  a  more  intel¬ 
ligent  law  like  the  DEPN-3D  law  is  needed  for  optimal  performance, 
rather  than  a  basic  law  like  the  APPN  that  merely  nullifies  tar¬ 
get  maneuver  without  any  means  of  knowing  whether  doing  so 
is  beneficial  or  not. 
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M-T  trajectories,  tail  chase,  off-nominal, 


Fig.  8.  Case  1(d)  -  M-T  trajectories,  tail  chase,  off-nominal,  unfavorable  step  mnvr, 
0.39g. 


5.5.  Receding,  step  maneuvering  target  (Cases  1(d)  and  (e)) 

The  performance  of  the  laws  for  a  step  maneuvering  target  is 
studied  next.  By  trying  out  all  combinations  as  for  the  B-t-B  maneu¬ 
ver  cases,  the  most  unfavorable  step  maneuver  for  the  PPN  law  is 
determined  to  be  a  negative  value  of  ayt,  and  a  positive  value  for 
apt .The  maximum  magnitude  for  which  all  laws  intercept  for  this 
case  (Case  1  (d)  in  Table  1 )  is  found  to  be  0.39g,  which  is  just  more 
than  a  fourth  of  the  1.52g,  the  magnitude  in  the  most  unfavorable 
B-t-B  case  (Case  1(b)).  This  would  suggest  that  the  step  maneuver 
is  more  effective  than  a  B-t-B  maneuver  for  the  target.  Examining 
the  case  in  which  the  step  maneuver  actually  favors  interception 
by  the  PPN  (Case  1(e)),  the  maximum  maneuver  magnitudes  that 
allow  interception  are  less  than  those  for  Case  1(b)  or  Case  1(c). 
This  suggests  that,  at  least  for  the  high  heading  error  scenarios 
considered  in  this  paper,  the  step  maneuver  is  more  effective  than 
the  B-t-B  maneuver  from  the  target’s  point  of  view.  Conversely, 
from  the  missile  point  of  view,  all  the  guidance  laws  including 
the  DEPN-3D  law  are  more  robust  to  B-t-B  maneuver  than  the 
step  maneuver.  The  optimal  APPN  law  performs  marginally  bet¬ 
ter  than  PPN  law  for  Casel(d),  since  the  compensation  term  in  the 
latax  includes  an  upward  pitching  component  that  proves  bene¬ 
ficial  later  on.  But  in  Casel(e),  this  compensation  for  downward 
maneuver  makes  the  missile  lose  height  initially,  resulting  in  a 
performance  that  is  poorer  than  even  the  PPN  as  per  four  out 
of  the  five  criteria  for  comparison.  The  best  two  performers  are 
the  offline  DEPN-3D  and  the  online  DEPN-3D  for  the  step  maneu¬ 
ver  too.  The  missile-target  trajectories  for  this  case  are  shown  in 
Fig.  8. 


Latax's  of  all  laws,  head  on  case 


M-T  trajectories  of  all  laws,  head  on  case 


Fig.9.  Case  2(a) -approaching  target,  initial  separation  =  9000  m,  hp  =  -45°,  hy  = 
45°,  aM  =  15g. 


5.6.  Nominal  case  with  an  approaching  target  (Case  2(a)) 

The  scenario  of  an  approaching  target,  that  arises  more  often  in 
tactical  missile  guidance  than  that  of  a  receding  target,  is  considered 
next.  The  optimal  solution  for  the  AAPN  of  Sim  et  al.  [39]  has  been 
obtained  for  the  receding  target,  but  it  is  still  a  better  performer 
than  the  PPN  even  for  the  approaching  target.  Hence  it  is  retained 
in  this  paper  for  comparison,  as  a  useful  benchmark.  The  nominal 
case  of  a  non-maneuvering  ( ayt  =  apt  =  0),  approaching  target  flying 
exactly  head-on  to  the  missile  initially  (yt  =  0°,  0t  =  180°)  is  consid¬ 
ered,  as  in  Raghunathan  and  Ghose  [31],  The  initial  position  of  the 
missile  is  same  as  in  the  beginning  of  Section  5.2.  The  only  change 
is  the  initial  separation  xt,  which  is  assumed  to  be  9000  m,  instead 
of  3  000  m.  The  initial  heading  errors  are  assumed  to  be  hp  =  —45°, 
hy  =  45°. 

The  performance  of  all  the  laws  are  shown  in  Table  2  and  Fig.  9(a) 
and  (b).  In  this  case,  the  online  DEPN-3D  is  a  poorer  performer  than 
the  AAPN  by  all  the  criteria,  since  it  uses  a  slightly  lower  latax  than 
the  AAPN  until  £4  (Fig.  9(a)),  and  the  flight  time  remaining  after  ti  is 
less,  as  compared  to  Cases  l(a)-(e).  This  does  not  allow  it  the  scope 
to  improve  further.  This  is  also  seen  in  Fig.  9(b);  the  online  DEPN-3D 
trajectory  dips  as  low  as  that  of  the  PPN,  before  rising  somewhat  to 
improve  on  the  PPN.  Between  the  two  offline  methods,  the  offline 
DEPN-3D  is  expectedly  a  much  superior  performer  by  four  of  the 
five  criteria  ( tpj ,  Vm(tf )  and  the  max.  aT0  up  to  which  target  capture 
occurs),  since  its  design  uses  the  same  initial  engagement  geom¬ 
etry  with  the  same  realistic  engagement  model  with  which  it  is 
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Table  2 

Case  2  -  approaching  target,  initial  separation  =  9000  m,  hp = -45°,  hy  =  45°. 

PPN  (online  implementable)  AAPN  (offline  only) 


(a) nom:ar  =  0  12.60 

(b) Oyto  =  —2.86  g,  12.84 

Opto  =  -2.86  g,~-(B-t-B,  most  unfavorable  mnvr) 

(c)  Oyto  =  +2.86  g,  13.29 

Opto  =  +2.86  g,~(B-t-B,  less  unfavorable  mnvr) 
tf  (s)  (d)ayt  =  -9.16g,  17.36 

Opt  =  +9.16  g,  step  (unfavorable  mnvr) 

(e)dyt  =  -9.16g,  20.48 

Opt = -9.1 6  g,  step  (favorable  mnvr) 

(a) nom:ar  =  0  100% 

=1.363  x  105 

(b) -2.86-(B-t-B)  104.48 

J(%)  (c)+2.86~(B-t-B)  102.27 

(d) a=9.16g,  step  110.42 

(e)  -9.1 6  g,  step  1 25.09 

(a)nom:aT  =  0  133.06 

am(max.)  (b) -2.86 -(B-t-B) 

m/s2  (c)+2.86~(B-t-B)  144.53 

(d) =F9-16g,  step 

(e) -9.16g,  step 

(a)nom:  aT  =  0  586.57 

Vm  (tr)  (b)-2.86-(B-t-B)  562.33 

(m/s2)  (c)+2.86~(B-t-B)  570.17 

(d) a=9.16g,  step  471.08 

(e) -9.16g,  step  354.32 

Capture  (b)-(B-t-B)  2.86g 

upto  (c)  ~(B-t-B)  9.30g 

dT0(max.)  (d)  =f,  step  9.1 6g 

(e)-,step  20.37g 

a  NI  =  no  interception. 


12.37 

12.61 

13.03 

17.19 

20.01 


91.64 

96.11 

95.89 

110.71 

122.01 

123.40 
147(=  am) 
141.09 

147(=  am) 

604.98 

580.09 

588.51 

484.81 

366.70 

4.10g 

9.45g 

9.77g 

20.66g 


DEPN-3D  (offline  DEPN-3D  (online 

implementable)  implementable) 


11.49 

11.76 

12.07 

16.65 

17.66 


69.80 

67.59 

81.07 

111.37 

104.62 


147(=am) 


672.68 

650.45 

653.0 

527.41 

435.26 

20.42g 

19.93g 

21.98g 

22.73g 


12.53 

12.66 

13.26 

17.38 

19.78 


94.79 

90.02 

102.13 

114.31 
121.28 

147(=  am) 

141.37 

147(=am) 

117.77 

595.32 
581.93 
574.12 
474.02 

375.14 
15.99g 

20.21g 

19.10g 

20.13g 


APPN  (optimal  law; 
online  implementable) 
12.60 


12.56 

19.61 


133.06 

NIa 

127.15 


tested.  Whereas,  the  AAPN  was  designed  for  tail  chase  condition,  for 
a  kinematic  model.  This  superior  performance  is  due  to  the  much 
higher  latax  (equal  to  the  saturation  limit  am )  employed  by  the 
offline  DEPN-3D  from  initial  time  t0  itself  (Fig  9(a)). 

5. 7.  Approaching,  bank-to-bank  maneuvering  target  ( Cases  2(b) 
and  (c)) 

As  with  Case  1(b),  the  most  unfavorable  B-t-B  target  maneuver 
for  the  PPN  law  is  determined  by  trial  and  error.  For  this  case  (Case 
2(b)),  the  optimal  APPN  does  not  even  intercept  the  target  since, 
in  a  limiting  case,  it  nullifies  an  initially  beneficial  target  maneuver 
that  could  actually  facilitate  interception,  disadvantaging  itself  by 
losing  height  in  the  process  of  doing  so.  This  is  reversed  in  Case 
2(c),  in  which  the  missile  guided  by  APPN  gains  height  initially, 
and  hence,  achieves  interception.  For  the  remaining  three  laws,  the 
pattern  of  results  for  the  nominal  case  of  2(a)  is  repeated,  with  the 
performance  ranking  between  these  being  offline  DEPN-3D,  AAPN, 
and  online  DEPN-3D,  in  descending  order. 

5.8.  Approaching,  step  maneuvering  target  (Cases  2(d)  and  (e)) 

The  same  pattern  of  results  is  repeated  for  step  target  maneuver, 
with  the  APPN  in  Case  2(e)  being  similar  to  Case  2(b)  in  failing  to 
achieve  interception,  The  logical  explanations  of  why  these  happen 
are  also  similar  Case  1(e):  faced  with  an  initial  target  maneuver  that 
favors  interception,  the  APPN  nullifies  this,  since  it  has  no  means  of 
knowing  that  the  maneuver  is  actually  a  beneficial  one.  It  expends 
extra  energy  in  the  process  than  the  PPN,  and  hence  is  not  able 
accomplish  what  the  PPN  does,  for  what  are  limiting  values  of  tar¬ 
get  maneuvers  for  the  PPN.  Case  2(d)  is  somewhat  like  Case  2(c),  in 
being  able  to  achieve  interception:  the  latax  that  includes  the  com¬ 
pensation  term  for  enables  it  to  rise  in  altitude  initially.  Between 


the  remaining  three  laws,  the  offline  DEPN-3D  law  is  again  the  best 
performer  like  in  the  rest  of  the  cases  considered,  followed  by  AAPN, 
and  online  DEPN-3D. 

In  conclusion,  the  APPN  does  not  have  any  edge  over  the  PPN 
by  virtue  of  its  having  the  extra  information  of  target  maneuver. 
This  edge  would  be  available  only  if  even  more  extra  information 
of  whether  the  maneuver  is  beneficial  or  otherwise  is  available  for 
use.  The  DEPN-3D  laws  have  this  edge,  and  are  naturally  the  best 
performers  of  the  lot. 

6.  Conclusions 

A  completely  new  approach  to  optimal  missile  guidance,  based 
on  a  3-D  realistic  dynamic  model,  and  using  an  evolutionary  algo¬ 
rithm  that  is  online  implementable  has  been  presented.  By  and 
large,  the  DEPN-3D  laws  designed  by  using  this  approach  are  shown 
to  be  best  performers,  and  the  most  robust  to  target  maneuver,  in 
the  overall  analysis  of  all  the  laws  studied  in  this  work. 

A  comparison  of  the  proposed  laws  with  the  optimal  APPN 
reveals  that,  in  a  realistic  scenario,  a  more  intelligent  law  like  the 
DEPN-3D  law  is  needed  for  optimal  performance,  rather  than  a  basic 
law  like  the  APPN  that  merely  nullifies  target  maneuver  with  no 
means  of  knowing  whether  doing  so  is  beneficial  or  not. 

Unlike  the  closed  form  optimal  guidance  laws  that  require  time- 
to-go  information,  the  DEPN-3D  laws  presented  here  does  not 
require  time-to-go  information  for  its  implementation. 

A  detailed  study  of  the  effects  of  the  variation  in  the  tuning 
parameters  of  DE  may  be  conducted  as  future  investigation,  for 
issues  like  speed  of  convergence  and  faster  implementation. 

Given  the  fact  that  no  optimization  algorithm  can  work  equally 
well  on  all  classes  of  problems,  it  is  felt  that  there  exists  consid¬ 
erable  scope  for  studying  whether  there  can  be  a  better  match 
between  the  optimization  problem  in  the  paper  and  the  DE  variant 
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used  to  solve  it.  The  problem  can  be  studied  for  specific  features 
that  can  be  exploited  for  faster  optimization  by  the  state-of-the- 
art  DEs  that  have  appeared  in  the  literature.  Expanding  the  scope 
further,  would  be  interesting  to  compare  the  performance  of  other 
metaheuristic  methods  against  the  results  presented  in  this  paper. 
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Appendix  A. 

The  Augmented  PPN  (APPN) 

The  PPN  has  been  proven  to  be  the  energy-optimal  guidance  law 
at  least  under  some  conditions,  for  a  non-maneuvering  target.  For 
a  maneuvering  target,  this  optimality  of  the  PPN  can  be  preserved 
by  adding  an  augmentation  term  that  compensates  for  the  target 
maneuver.  The  compensation  term  includes  the  target  latax  normal 
to  the  LOS,  and  this  term  nullifies  the  rate  of  rotation  of  LOS  due 
to  the  target  maneuver.  This  law,  known  as  the  augmented  PPN 


(APPN),  is  given  by 

ayt,  APPN  =  %t,  PPN  +  ^Ndyt  COS  1 jft 

(48) 

apt,  APPN  =  apt,  PPN  +  2^apt  COS  0t 

(49) 

where  the  angles  ijft  and  9t  are  as  given  in  Fig.  1. 
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